Hao AI Lab

mentions 1 type Organization feed RSS

// recent coverage 1 mentions

04:00

2026-06-18

arxiv.org

large-language-models

JetFlow: Breaking the Scaling Ceiling of Speculative Decoding with Parallel Tree Drafting

Researchers from Hao AI Lab introduced JetFlow, a speculative decoding framework that breaks the scaling ceiling of autoregressive LLMs by combining one-forward drafting efficiency with branch-wise ca…

// co-occurs with top 5 entities

JetFlow 1 Qwen3 1 MATH-500 1 H100 GPU 1 vLLM 1